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RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: O^fel*, DECEIVED 

Source: i^OO JUL Q g . 

Date Processed by STIC: t[\*\[«l' 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. fECH CENTER 1 600/2900 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTED 3.0 e-mail help: patin3heIp@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offfices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
20 1 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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Raw Sequence Listing Error Summary 
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ERROR DETECTED 

ATTN: J*EW RULES CASES: 

1 Wrapped Nuclei cs 

Wrapped Aminos 



SUGGESTED CORRECTION SERIAL NUMBER: 

oiliA Sua 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped* 1 down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 



Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

_Non* ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission Is saved in ASCII text 

_ Variable Length Sequences) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

Patentln 2.0 A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

"bug" sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



_Skipped Sequences Sequences) 
(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences Sequences) _ 



missing. If Intentional, please insert the following lines for each skipped sequence. 



(NEW RULES) 



_Use of n's or Xaa's 
(NEW RULES) 



10 I nvalid <213> 
Response 



<2I0> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

/* ~ * 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence ■ 



12 



Useof<220> 



Patentln 2.0 * 
"bug" 



Sequence^) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," U6/01/19y«, Vol. 63, No. 1U4, pp. 19bM-S2i (Sec. L.HltJ of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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Does Not Comply 
Corrected Diskette Needed 



1600 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/623 , 568A 



DATE: 06/14/2002 
TIME: 15:10:55 



■Input;. Set : A:\EP.txt . *■ 

Output Set: N:\CRF3\06142002\I623568A.raw 



3 <110> APPLICANT: Miller, Barbara 

4 Osmani , Stephen 

5 Clawson, Gary 
* 6 ■' Zhang, Min-Ying 

7 Norris, James 

9 <120> TITLE OF INVENTION 
Treatment And 

10 Diagnosis Of Cancer 

12 <130> FILE REFERENCE: PSU-a016 

14 <140> CURRENT APPLICATION NUMBER: 09/623, 568A 

15 <141> CURRENT FILING DATE: 2001-03-23 

17 <150>- PRIOR APPLICATION NUMBER: 60/076,885 

18 <151> PRIOR .FILING DATE: 1998-03-05 

20 <150> PRIOR APPLICATION NUMBER: PCT US99/04996 

21 <151> PRIOR FILING DATE: 1999-03-05 
23 <160> NUMBER OF SEQ ID NOS : 16 

.25 <170> SOFTWARE: Patentln version 3.1 

27 <210> SEQ ID NO: 1 

28 <211> LENGTH: 14 
29. <212> TYPE: PRT 

30 <213> ORGANISM : artificial' Sequence 

32 <220> FEATURE: 

33 <223> OTHER INFORMATIO 
35 <400> SEQUENCE: 1 



Use of Human Homolog Of A Nuclear Migration Gene For 




37 Gly Cys Met Val GluLys Met Met Tyr Asp Gin Arg Gin Lyfe 



fwtff e^Uv, <^m^ se^f, 



38 



10 



1 5 

41 <210> SEQ ID NO: 2 

42 <211> LENGTH: 15 

43 <212> TYPE: PRT 

44 <213> ORGANISM: artificial Sequence 

46 <220> FEATURE: 

47 <223> OTHER INFORMATION : (Peptide 
49 <400> SEQUENCE: 2 
51 Asn Gly Ser Leu Asp Ser Pro Gly Lys Gin Asp Thr Glu Glu Asp 



52 



10 



1 5 

55 <210> SEQ ID NO: 3 

56 <211> LENGTH: 24 

57 <212> TYPE: DNA 

58 <213> ORGANISM: artificial Sequence 

60 <220> FEATURE: 

61 <223> OTHER INFORMATION: Oligonucleotide 
63 <400> SEQUENCE: 3 

ttctgttcgt ctgaagttgg cage 
<210> SEQ ID NO: 4 



15 



64 
67 



24 



file://C:\Crf3\Outhold\VsrI623568A.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/623 , 568A 



DATE: 06/14/2002 
TIME: 15:10:55 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\06142002\I623568A.raw 



68<211> LENGTH: 24 - 

69 <212>.TYPE: DNA 

70 <213> ORGANISM: artificial Sequence - - 

72 <220> FEATURE: 

73 <223> OTHER INFORMATION: Oligonucleotide 

75 <400> SEQUENCE: 4 

76 caatgaagtg aaggtggagg agag 24 

79 <210> SEQ ID NO: 5 

80 <211> LENGTH: 38 

81 <212> TYPE: DNA 

82 <213> ORGANISM: artificial Sequence 

84 <220> FEATURE: 

85 <223> OTHER INFORMATION: Oligonucleotide 
8*7 <400> SEQUENCE: 5 

88 aaggtaccaa gatggactcc ccagggaagc aggatact 38 

91 <210> SEQ ID NO: 6 

92 <211> LENGTH: 32 

93 <212> TYPE: DNA . " 

94 <213> ORGANISM: artificial Sequence 

96 <220> FEATURE: 

97 <223> OTHER INFORMATION: Oligonucleotide 

99 <400> SEQUENCE: 6 

100 aaggatccaa gaaagttggg tggttgcagc tc 32 

103 <210> SEQ ID NO: 7 

104 <211> LENGTH: 20 

105 <212> TYPE: DNA 

106 <213> ORGANISM: artificial Sequence 

108 <220> FEATURE: 

109 <223> OTHER INFORMATION: Oligonucleotide 

111 <400> SEQUENCE: 7 

112 gaaagtcgga ggttcgaaga 20 

115 <210> SEQ ID NO: 8 

116 <211> LENGTH: 20 

117 <212> TYPE: DNA 

118 <213> ORGANISM: artificial Sequence 

120 <220> FEATURE: 

121 <223> OTHER INFORMATION: Oligonucleotide 

123 <400> SEQUENCE: 8 

124 accaactaag aacggccatg 20 

127 <210> SEQ ID NO: 9 

128 <211> LENGTH: 24 

129 <212> TYPE: DNA 

130 <213> ORGANISM: artificial Sequence 

132 <220> FEATURE: 

133 <223> OTHER INFORMATION: Oligonucleotide 

135 <400> SEQUENCE: 9 

136 agcaacatgc cgtcgaaccg ctcc 24 % 

139 <210> SEQ ID NO: 10 

140 <211> LENGTH: 24 
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RAW SEQUENCE LISTING DATE: 06/14/2002 

PATENT APPLICATION: US/09/623 , 568A TIME: 15:10:55 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\06142002\I623568A.raw 

141 <212> TYPE: DNA 

142 <213> ORGANISM: artificial Sequence 

144 <220> FEATURE: 

145 <223> OTHER INFORMATION: Oligonucleotide 

147 <400> SEQUENCE: 10 

148 ggagcggttc gacggcatgt tgct 24 

151 <210> SEQ ID NO: 11 

152 <211> LENGTH: 1281 

153 <212> TYPE: DNA 

154 <213> ORGANISM: Homo sapiens 

156 <400> SEQUENCE: 11 

157 ctagagtgca gagctccggg acgtggatcg gagccggcgc gatgggcgga gagcaggagg 60 
159 aggagcggtt cgacggcatg ttgctggcca tggctcagca gcacgagggc ggcgtgcagg 120 
161 agcttgtgaa caccttcttc agcttccttc gacgcaaaac agactttttc attggaggag 180 
163 aagaagggat ggcagagaag cttatcacac agactttcag ccaccacaat cagctggcac 240 
165 agaagacccg gcgggagaag agagcccggc aggaggccga gcggcgggag aaggcggagc 300 
167 gggcggccag actggccaag gaagccaagt cagagacctc agggccccag atcaaggagc 360 
169 taactgatga agaggcagag aggctgcagc tagagattga ccagaaaaag gatgcagaga 420 
171 atcatgaggc ccagctcaag aacggcagcc ttgactcccc agggaagcag gatactgagg 480 
173 aagatgagga ggaagatgag aaggacaaag gaaaactgaa gcccaaccta ggcaacgggg 540 
175 cagacctgcc caattaccgc tggacccaga ccctgtcgga gctggacctg gcggtccctt 600 
177 tctgtgtgaa cttccggctg aaagggaagg acatggtggt ggacatccag cggcggcacc 660 
179 tccgggtggg gctcaagggg cagccagcga tcattgatgg ggagctctac aatgaagtga 720 
181 aggtggagga gagctcgtgg ctcattgagg acggcaaggt ggtgactgtg catctggaga 780 
183 agatcaataa gatggagtgg tggagccgct tggtgtccag tgaccctgag atcaacacca 840 
185 agaagattaa ccctgagaat tccaagctgt cagacctgga cagtgagact cgcagcatgg 900 
187 tggaaaagat gatgtatgac cagcgacaga agtccatggg gctgccaact tcagacgaac 960 
189 agaagaaaca ggagattctg aagaagttca tggatcaaca tccggagatg gatttttcca 1020 
191 aggctaaatt caactagccc ctgttttttc ctccctgaac tcttggggct gagctgcaac 1080 
193 cacccaactt tctttcccac tcttctctgg gacttgtggg cctcagggct tggggcaggc 1140 
195 atgggactgg cccaggcaca caggtcccgg ggcatcagga gaaaggctgg gtcttgggac 1200 
197 cttgtcctcc ccagttggcc tactgttaca cattaaaacg atttgcccag ctcaaaaaaa 1260 
199 aaaaaaaaaa aaaaaaaaaa a 1281 
202 <210>' SEQ ID NO: 12 

203. <211> LENGTH: 331 .. . 

204 <212> TYPE: PRT 

205 <213> ORGANISM: Homo sapiens 
207 <400> SEQUENCE: 12 



209 


Met Gly 


Gly 


Glu 


Gin 


Glu 


Glu 


Glu 


Arg 


Phe 


Asp Gly Met 


Leu 


Leu 


Ala 


210 


1 






5 










10 








15 




213 


Met Ala 


Gin 


Gin 


His 


Glu Gly 


Gly 


Val 


Gin 


Glu 


Leu Val 


Asn 


Thr 


Phe 


214 






20 










25 








30 






217 


Phe Ser 


Phe 


Leu 


Arg 


Arg 


Lys 


Thr 


Asp 


Phe 


Phe 


He Gly 


Gly Glu Glu 


218 




35 










40 








45 








221 


Gly Met 


Ala 


Glu 


Lys 


Leu 


He 


Thr 


Gin 


Thr 


Phe 


Ser His 


His 


Asn 


Gin 


222 


50 










55 










60 








225 


Leu Ala 


Gin 


Lys 


Thr 


Arg 


Arg 


Glu 


Lys 


Arg 


Ala 


Arg Gin 


Glu 


Ala 


Glu 


226 


65 








70 










75 








80 


229 


Arg Arg 


Glu 


Lys 


Ala 


Glu 


Arg 


Ala 


Ala 


Arg 


Leu 


Ala Lys 


Glu 


Ala 


Lys 
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RAW SEQUENCE LISTING DATE : 06/14/2002 

PATENT APPLICATION: US/09/623 , 568A TIME: 15:10:55 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\06142002\I623568A.raw 
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Ala 
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105 
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110 






TOT 
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blU 


Arg 


Leu 


bin 


Leu 


blU 


lie 


Asp 
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Lys 


Asp 


Ala 


C* 1 11 

blU 


Asn 


HIS 


ZJO 






lie 
115 










i ft n 
120 










1 ft c 

125 








O j1 1 

z41 


blU 


Aia 


bin 
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biy 


Ser 


Leu Asp 


ser 


Pro 
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Lys 


bin 


Asp 


z4z 
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135 










14U 










z 4 5 


Thr 


blU 


blU 


Asp 


bill 


blU 


r* 1 1 1 
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Asp 


Glu 


Lys 


Asp 


Lys 


biy 


Lys 


Leu 


Lys 


ft A C 

24 o 


14 5 










1 C ft. 

150 










ICC 

155 










loU 


lift 

24 y 
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Leu 
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Pro 
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Trp 
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ft c o 
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280 
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<210> SEQ ID NO: 


: 13 
























z y 4 


<211> LENGTH: 332 
























ft n c 

295 


<212> TYPE: 


PRT 


























zy o 


<213> ORGANISM: 


Rattus rattus 


















298. 


<400> SEQUENCE: 


13 
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Met 
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30 
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Ser 
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He 
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Lys 


Leu 


He 


Thr 


Gin 


Thr 


Phe 


Asn 


His 


His 


Asn 


Gin 


313 
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RAW SEQUENCE LISTING DATE: 06/14/2002 

PATENT APPLICATION: US/09/623 , 568A TIME: 15:10:55 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\06142002\I623568A.raw 
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Met 
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<210> SEQ ID NO: 


: 14 
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<211> LENGTH: 198 
























o o ^ 
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<212> TYPE: 


PRT 
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<213> ORGANISM: 


Aspergillus 


nidulans 
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<400> SEQUENCE: 


14 
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Ser 
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He 
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Leu 
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Val 
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Gin 


Met 
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Trp 


Trp 


Ala 
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Val 


Val 
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Thr 


Ala 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 06/14/2002 

PATENT APPLICATION: US/09/623 , 568A TIME: 15:10:56 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\06142002\I623568A.raw 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa, 

Seq#:16; Xaa Pos . 9,11 
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VERIFICATION SUMMARY DATE: 06/14/2002 

' • PATENT APPLICATION: US/09/623 , 568A TIME: 15:10:56 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\06142002\I623568A.raw 

L:496 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 16 after pos . : 0 
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